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CN , Abstract 

C , We study the properties of a simple greedy algorithm introduced in [8] for the generation of data- 

' adapted anisotropic triangulations. Given a function /, the algorithm produces nested triangulations 



7jv and corresponding piecewise polynomial approximations /jv of /. The refinement procedure picks 
the triangle which maximizes the local U" approximation error, and bisects it in a direction which is 
chosen so to minimize this error at the next step. We study the approximation error in the norm 
when the algorithm is applied to functions with piecewise linear approximations. We prove that 
as the algorithm progresses, the triangles tend to adopt an optimal aspect ratio which is dictated by 
the local hessian of /. For convex functions, we also prove that the adaptive triangulations satisfy 
. the convergence bound 11/ - JnWlp < CN-^\\^jAet{(P f)\\L^ with i — i + 1, which is known to be 

' asymptotically optimal among all possible triangulations. 
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1 Introduction 

^ I In finite element approximation, a classical and important distinction is made between uniform and 

fS| \ adaptive methods. In the first case all the elements which constitute the mesh have comparable shape 

■ and size, while these attributes are allowed to vary strongly in the second case. An important feature of 

\ adaptive methods is the fact that the mesh is not fixed in advance but rather tailored to the properties 

of the function / to be approximated. Since the function approximating / is not picked from a fixed 
linear space, adaptive finite elements can be considered as an instance of non-linear approximation. Other 
instances include approximation by rational functions, or by iV-term linear combinations of a basis or 
dictionary. We refer to [9] for a general survey on non-linear approximation. 

In this paper, we focus our interest on piecewise linear finite element functions defined over tri- 
angulations of a bidimensional polygonal domain Q, C M^. Given a triangulation T we denote by 
Vj- :~ {v s.t. G Hi, T e T} the associated finite element space. The norm in which we measure the 
$-H ' approximation error is the norm for 1 < p < oo and we therefore do not require that the triangulations 

are conforming and that the functions of Vj- are continuous between triangles. For a given function / we 
define 

eN{f)Lp ■■= inf iiif ||/-g||Lp, 

#{r)<NgeVT 

the best approximation error of / when using at most N elements. In adaptive finite element approxi- 
mation, critical questions are: 

1. Given a function / and a number N > 0, how can we characterize the optimal mesh for / with TV 
elements corresponding to the above defined best approximation error. 

2. What quantitative estimates are available for the best approximation error eN{f)Lp ? Such esti- 
mates should involve the derivatives of / in a different way than for non-adaptive meshes. 

3. Can we build by a simple algorithmic procedure a mesh T/v of cardinality N and a finite element 
function /jv G Vtn such that ||/ — JnWlp is comparable to eN{f)Lp ? 
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While the optimal mesh is usually difficult to characterize exactly, it should satisfy two intuitively 
desirable features: (i) the triangulation should equidistribute the local approximation error between each 
triangle and (ii) the; aspect ratio of a triangle T should be isotropic with respect to a distorted metric 
induced by the local value of the hessian cPf on T (and therefore anisotropic in the sense of the euclidean 
metric). Under such prescriptions on the mesh, quantitative error estimates have recently been obtained 
in [7, 1] when / is a function. These estimates are of the form 

1 1 



limsupiVeiv(/)Lp < C|| V| det(d2/)ll|L-, - = - + 1, (1-1) 

where det{(Pf) is the determinant of the 2x2 hessian matrix. For a convex function / this estimate 
has been proved to be asymptotically optimal in [7] , in the following sense 

hminf 7Ve;v(/)L. > c|| VI det(rf2/)IIU- (1-2) 

The convexity assumption can actually be replaced by a mild assumption on the sequence of triangulations 
which is used for the approximation of /: a sequence {Tn)n>No is said to be admissible if #(J'n) < N 
and 



sup ( N^^'^ max diam(T) ) < oo. 



N>No 

Then it is proved in [10], that for any admissible sequence and any function /, one has 



liminfiV inf \\ f - g\\ l. > 4 ^/\det{¥f)\\\ . (1.3) 
N->-+!x geVTff 

The admissibility assumption is not a severe limitation for an upper estimate of the error since it is also 
proved that for all £ > 0, there exist an admissible sequence such that 

limsupAT inf ||/ - fflU- < ^1 Vl det(rf2/)IIU- + £• (1-4) 

]V->+oo SS^Tn 

We also refer to [10] for a generalization of such upper and lower estimates to higher order elements. 

From the computational viewpoint, a commonly used strategy for designing an optimal mesh consists 
therefore in evaluating the hessian (P f and imposing that each triangle of the mesh is isotropic with 
respect to a metric which is properly related to its local value. We refer in particular to [3] where this 
program is executed using Delaunay mesh generation techniques. While these algorithms fastly produce 
anisotropic meshes which are naturally adapted to the approximated function, they suffer from two 
intrinsic limitations: 

1. They use the data of cPf, and therefore do not apply to non-smooth or noisy functions. 

2. They are non- hierarchical: for N > M, the triangulation Tn is not a refinement of 7m- 

In [8], an alternate strategy was proposed for the design of adaptive hierarchical meshes, based on 
a simple greedy algorithm: starting from an initial triangulation Tnq, the algorithm picks the triangle 
T € Tn with the largest local error. This triangle is then bisected from the mid-point of one of its 
edges to the opposite vertex. The choice of the edge among the three options is the one that minimizes the 
new approximation error after bisection. The algorithm can be applied to any function, smooth or not, 
in the context of piecewise polynomial approximation of any given order. In the case of piecewise linear 
approximation, numerical experiments in [8] indicate that this elementary strategy generates triangles 
with an optimal aspect ratio and approximations /jv G Vj-j^ such that ||/ — JnWlp satisfies the same 
estimate as eM{f)Lp in (l-l)- 

The goal of this paper is to support these experimental observations by a rigorous analysis. Our paper 
is organized as follows: 

In §2, we introduce notations which are used throughout the paper and collect some available approx- 
imation theory results for piecewise linear finite elements, making the distinction between (i) uniform, 
(ii) adaptive isotropic and (iii) adaptive anisotropic triangulations. In the last case, which is in the scope 



2 



of this paper, we introduce a measure of non-degeneracy of a triangle T with respect to a quadratic form. 
We show that the optimal error estimate (1.1) is met when each triangle is non-degenerate in the sense of 
the above measure with respect to the quadratic form given by the local hessian (Pf. We end by briefly 
recalling the greedy algorithm which was introduced in [8] . 

In §3, we study the behavior of the refinement procedure when applied to a quadratic function q such 
that its associated quadratic form q is of positive or negative; sign. A key observation is that the edge 
which is bisected is the longest with respect to the metric induced by q. This allows us to prove that 
the triangles generated by the refinement procedure adopt an optimal aspect ratio in the sense of the 
non-degeneracy measure introduced in §2. 

In §4, we study the behavior of the algorithm when applied to a general function / which is 
assumed to be strictly convex (or strictly concave). We first establish a perturbation result, which shows 
that when / is locally close to a quadratic function q the algorithm bchavcis in a similar manner as when 
applied to q. We then prove that the diameters of the triangles produced by the algorithm tend to zero 
so that the perturbation result can be applied. This allows us to show that the optimal convergence 

estimate 

limsupiV||/-/jv||L. <C||V|det(rf2/)|||^. (1.5) 

AT— s-oo 

is met by the sequence of approximations /at € Vy^ generated by the algorithm. 

The extension of this result to an arbitrary function / remains an open problem. It is possible 
to proceed to an analysis similar to §3 in the case where the quadratic form q is of mixed sign, also 
proving that the triangles adopt an optimal aspect ratio as they get refined. We describe this analysis in 
§9.1 of [11]. However, it seems difficult to extend the perturbation analysis of §4 to this new setting. In 
particular the diameters of the triangles are no more ensured to tend to zero, and one can even exhibit 
examples of non-convex functions / for which the approximation /jv fails to converge towards / due to 
this phenomenon. Such examples are discussed in [8] which also proposes a modification of the algorithm 
for which convergence is always ensured. However, we do not know if the optimal convergence estimate 
(1.5) holds for any f G with this modified algorithm, although this seems plausible from the numerical 
experiments. 

2 Adaptive finite element approximation 
2.1 Notations 

Wc shall make use of a linear approximation operator At that maps continuous functions defined on T 
onto Hi. For an arbitrary but fixed 1 < p < oo, we define the local approximation error 

eT(/)p := \\f - ATf\\L.(T). 
The critical assumptions in our analysis for the operator At will be the following: 

1. At is continuous in the L°° norm. 

2. At commutes with affine changes of variables: Arif) o (p = A^-i^T){f ° 4') for aflane (j). 

3. At reproduces Hi: ^T(7r) = tt, for any tt e Hi. 

Note that the commutation assumption implies that for any function / and any afline transformation 
(f) : X ^ xq + Lx we have 

e0(T)(/)p = I det(L)p/^'eT(/ o <^)p, (2.6) 
Two particularly simple admissible choices of approximation operators are the following: 

• At = Pt, the I/^(T)-orthogonal projection operator: /y(/ — PtD^^ = for all tt e Hi. 

• At = It, the local interpolation operator: Irfivi) = f{vi) with {vq,v\,V2} the vertices of T. 
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All our results are simultaneously valid when At is either Pt or It, or any linear operator that fulfills 
the three above assumptions. 

Given a function / and a triangulation Tn with N = i^{TN), we can associate a finite clement 
approximation /jv defined on each T e Tjv by /Ar(x) = ATf{x). The global approximation error is given 
by 

||/-/Ar||L. = (E ^T{f);)\ 
TeTiv 

with the usual modification when p = oo. 

Remark 2.1 The operator Bt of best U'{T) approximation which is defined by 

||/-St/||lp(t) = min ||/ - 7r||Lp(T), 

does not fall in the above category of operators, since it is non-linear (and not easy to compute) when 
p ^ 2. However, it is clear that any estimate on ||/ — /atHlp with /jv defined as ATf on each T implies 
a similar estimate when fpf is defined as BTf on each T. 

Here and throughout the paper, when 

q{x, y) = a2,ox^ + 2ai^ixy + ao,22/^ + ai^x + ao,iy + ao,o 

we denote by q the associated quadratic form : if u = {x, y) 

q(u) = a2,ox'^ + 2ai^ixy + ao,2y^- 

Note that q(u) = {Qu,u) where Q = ( '^^'^ '^^'^ | . We define 
^ ^ ^ ' ^ \ ai,i ao,2 / 

det(q) := Aet{Q). 

If q is a positive or negative quadratic form, we define the q-metric 

klq := vlqM (2.7) 

which coincides with the cuclidcan norm when q(w) = x"^ + y'^ for v = {x,y). If q is a quadratic form 
of mixed sign, we define the associated positive form |q| which corresponds to the symmetric matrix \Q\ 
that has same eigenvectors as Q with eigenvalues (|A|, if (A, //) are the eigenvalues of Q. Note that 
generally |q|(w) ^ |q(w)| and that one always has |q(u)| < |q|(M). 

Remark 2.2 If det Q > 0, then there exists a 2 x 2 matrix L and s G {+1, —1} such that 

The linear change of coordinates (f){u) := Lu, where u = {x,y) £ M^, therefore satisfies qo ^(u) = 
e{x'^ + y^)- On the other hand, if det Q < then there exists a 2 x 2 matrix L such that 

-"0- 

Defining again ^{u) := Lu we obtain in this case qo <^(u) = x^ — y^. 
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2.2 Prom uniform to adaptive isotropic triangulations 

A standard estimate in finite element approximation states that if / e W'^'P{fl) then 

inf \\f-9\\L.<Ch^cff\\L., 

where Vh is the piecewise linear finite element space associated with a triangulation Th of mesh size 
h := maxTeTh diam(T). If we restrict our attention to uniform triangulations, we have 

N := #{%) ~ h-^. 

Therefore, denoting by e^'*(/)LP the approximation error by a uniform triangulation of cardinality 
A'', we can re-express the above estimate as 

er'(/)Lp < CN'^\\d^f\\L.. (2.8) 

This estimate can be significantly improved when using adaptive partitions. We give here some heuristic 
arguments, which are based on the assumption that on each triangle T the relative variation of (Pf is 
small so that it can be considered as constant over T (which means that / is replaced by a quadratic 
function on each T), and we also indicate the available results which are proved more rigorously. 
First consider isotropic triangulations, i.e. such that all triangles satisfy a uniform estimate 

PT = — <A, (2.9) 

where hr ■= diam(T) denotes the size of the longest edge of T, and rr is the radius of the largest disc 
contained in T. In such a case we start from the local approximation estimate on any T 

eT(/)p<C4||dVlU.(T), 

and notice that 

with i := i + land |r| the area of T, where we have used the isotropy assumption (2.9) in the equivalence 
and the fact that (Pf is constant over T in the equality. It follows that 

eT{f)p<C\\d^f\\L^iT), 7 :=^ + l- 

Assume now that we can construct adaptive isotropic triangulations 7jv with N := #{Tn) which equidis- 
tributes the local error in the sense that for some prescribed £ > 

c£ < CTiDp < e, (2-10) 

with c > a fixed constant independent of T and N. Then defining /jv as Arif) on each T £ Tn, we 
have on the one hand 

||/-/jv|kp < aV%, 

and on the other hand, with - := ^ + 1, 

Nicer < J2 \\f-fN\\UT)<C^ E \\d'f\\l^T)<Cl\d'f\\l.. 

Combining both, one obtains for eJ^°(/)LP := ||/ — JnUlp the estimate 

e%°if)L. < CA-i||dVlU- (2.11) 

This estimate improves upon (2.8) since the rate N^^ is now obtained with the weaker smoothness 
condition d^f € and since, even for smooth /, the quantity ||(i^/||LT might be significantly smaller 
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than ||d^/||iP. This type of result is classical in non-linear approximation and also occurs when we 
consider best A'^-term approximation in a wavelet basis. 

The principle of error equidistribution suggests a simple greedy algorithm to build an adaptive isotropic 
triangulation for a given /, similar to our algorithm but where the bisection of the triangle T that 
maximizes the local error eT{f)p is systematically done from its most recently created vertex in order to 
preserve the estimate (2.9). Such an algorithm cannot exactly equilibrate the error in the sense of (2.10) 
and therefore does not lead to the same the optimal estimate as in (2.11). However, it was proved in [2] 
that it satisfies 

\\f-fN\\L^<C\f\Bl^N-\ 

for all T such that ^ < | + 1, provided that the local approximation operator At is bounded in the 
norm. Hero ^ denotes the usual Bcsov space which is a natural substitute for W'^''^ when r < 1. 
Therefore this estimate is not far from (2.11). 



2.3 Anisotropic triangulations: the optimal aspect ratio 

We now turn to anisotropic adaptive triangulations, and start by discussing the optimal shape of a 
triangle T for a given function / at a given point. For this purpose, wc again replace / by a quadratic 
function assuming that d^/ is constant over T. For such a g G 112 and its associated quadratic form q, 
we first derive an equivalent quantity for the local approximation error. Here and as well as in §3 and §4, 
wc consider a triangle T and we denote by (a, b, c) its edge vectors oriented in clockwise or anti-clockwise 
direction so that 

a + b + c = 0. 

Proposition 2.3 The local L^- approximation error satisfies 

eT{q)p = eT(q)p ~ \T\p max{|q(a)|, |q(6)|, |q(c)|}, 
where the constant in the equivalence is independent of q, T and p. 

Proof: The first equality is trivial since q and q differ by an affinc function. Let T^q be an equilateral 
triangle of area |Teq| = 1, and edges a,b,c. Let E be the 3-dimensional vector space of all quadratic 
forms. Then the following quantities are norms on E, and thus equivalent: 

eT.,(q)p ~ max{|q(a)|, |q(6)|, |q(c)|}. (2.12) 

Note that the constants in this equivalence are independent of p since all Lp{T) norms are uniformly 
equivalent on E. 

If T is an arbitrary triangle, there exists an affine transform (j) : x xq + Lx such that T = ^{T^q). 
For any quadratic function q, we thus obtain from (2.6) 

eT(q) = eT{q) = e^(Te,)(?) = \ det L\p eT,^{q o (j)) = | det Ll^eT.Jq o L) 
since q o L is the homogeneous part of qocf). By (2.12), we thus have 

eT(q) - |detL|imax{|q(La)|, \q{Lb)\, |q(Lc)|}, 

where {a, b, c} are again the edge vectors of Teq. Remarking that \T\ = \ det L\ and that {La, Lb, Lc} are 
the edge vectors of T, this concludes the proof of this proposition. <> 



In order to describe the optimal shape of a triangle T for the quadratic function q, we fix the area 
of |T| and try to minimize the error eT{q)p or equivalently max{|q(a)|, |q(&)|, |q(c)|}. The solution to 
this problem can be found by introducing for any q such that det(q) ^ the following measure of 
non- degeneracy for T: 

^ max{|q(a)|,|q(6)|,|q(c)|} 

^•'^ • ITI^^ditM 
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Let (phe a, linear change of variables, q a quadratic form and T a triangle of edges a, h. c. Then det(qo(^) = 
(det (t>f det(q), the edges of (t>{T) are (/)(a), 0(6), (^(c) and |(/>(r)| = |det0||r|. Hence we obtain 



_ max{|qo 0(a)|, |qo 0(6)|, |qo (/>(c)|} _ max{|q((/)(a))|, |qW6))|, |q((/'(c))|} _ 

|T|V|det(qo0)| - Idet^llTlv^ditM " ^^W^T)). 

(2.14) 

The last equation, combined with Remark 2.2, allows to reduce the study of Pq(T') to two elementary 
cases by change of variable: 

1. The case where det(q) > is reduced to (\(x,y) = + . Recall that for any triangle T with 
edges a, 6, c we define /it := diam(T) = max{|a|, |c|}, with | • | the euclidean norm. In this case 

we therefore have /Oq(T) = which corresponds to a standard measure of shape regularity in the 
sense that its boundedness is equivalent to a property such as (2.9). This quantity is minimized 
when the triangle T is equilateral, with minimal value (in fact it was also proved in [5] that the 
minimum of the interpolation error ||q — /Tq||LP(T) among all triangles of area |T| = 1 is attained 
when T is equilateral). For a general quadratic form q of positive sign, we obtain by change of 
variable that the minimal value is obtained for triangles which are equilateral with respect to 
the metric | • |q. More generally triangles with a good aspect ratio, i.e. a small value of />q(T'), are 
those which arc isotropic with respect to this metric. Of course, a similar conclusion holds for a 
quadratic form of negative sign. 

2. The case where det(q) < is reduced to q(a;, y) = — y"^. In this case, the analysis presented in [4] 
shows that the quantity Pq(T) is minimized when T is a half of a square with sides parallel to the 
X and y axes, with minimal value 2. But using (2.14) we also notice that Pq(T) = Pq(L(r)) for any 
linear transformation L such that q = q o L. This holds if L has eigenvalues (A, j), where A ^ 0, 
and eigenvectors (1,1) and (—1,1). Therefore, all images of the half square by such transformations 
L are also optimal triangles. Note that such triangles can be highly anisotropic. For a general 
quadratic form q of mixed sign, we notice that Pq(r) < p|q|(T), and therefore triangles which are 
equilateral with respect to the metric | • ||q| have a good aspect ratio, i.e. a small value of Pq{T). 
In addition, by similar arguments, we find that all images of such triangles by linear transforms L 
with eigenvalues (A, j) and eigenvectors {u,v) such that q(w) = q{v) = also have a good aspect 
ratio, since q = q o L for such transforms. 

We leave aside the special case where det(q) = 0. In such a case, the triangles minimizing the error for a 
given area degenerate in the sense that they should be infinitely long and thin, aligned with the direction 
of the null eigenvalue of q. 

Summing up, we find that triangles with a good aspect ratio are characterized by the fact that pq{T) 
is small. In addition, from Proposition 2.3 and the definition of Pq(T'), we have 

eT(g)p~|T|i+Fvqd^Pq(T) = ||^d^|U.(T)/Oq(r), ^:=^ + l- (2-15) 

We now return to a function / such that (Pf is assumed to be constant on every T gTn- Assuming that 
all triangles have a good aspect ratio in the sense that 

Pq(T) < C 

for some fixed constant C and with q the value of (Pf over T, we find up to a change in C that 



eriDp < C\\ V|ciet(d2/)II|L^(T) (2.16) 

By a similar reasoning as with isotropic triangulations, we now obtain that if the triangulation equidis- 
tributes the error in the sense of (2.10) 



\\f - InHl. <CN-'\W\det{d^f)\\\L^, (2.17) 
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and therefore (1.1) holds. This estimate improves upon (2.11) since the quantity || det{(P f)\\\ might 
be significantly smaller than in particular when / has some anisotropic features, such as sharp 

gradients along curved edges. 

The above derivation of (1.1) is heuristic and non-rigorous. Clearly, this estimate cannot be valid as 
such since det(rf^/) may vanish while the approximation error does not (consider for instance / depending 
only on a single variable). More rigorous versions were derived in [7] and [1]. In these results f\ is 
typically replaced by a majorant f\ + el, avoiding that its determinant vanishes. The estimate (1.1) 
can then be rigorously proved but holds for N > N{£, f) large enough. This limitation is unavoidable and 
reflects the fact that enough resolution is needed so that the hessian can be viewed as locally constant 
over each optimized triangle. Another formulation, which is rigorously proved in [10] , reads as follows. 

Proposition 2.4 There exists an absolute constant C > such that for any polygonal domain Cl and 
any function f G (7^(0), one has 

limsupNeN{f)LP < C\W\det{d^f)\\\L^. 
2.4 The greedy algorithm 

Given a target function /, our algorithm iteratively builds triangulations T/v with = ^^(T/v) and finite 
clement approximations /at. The starting point is a coarse triangulation T/Vo- Given T/v, the algorithm 
selects the triangle T which maximizes the local error eT{f)p among all triangles of T/v, and bisects it 
from the mid-point of one of its edges towards the opposite vertex. This gives the new triangulation 
T/v+i- 

Tlic critical part of the algorithm lies in tlie clioice of the edge e G {a, b, c} from which T is bisected. 
Denoting by and the two resulting triangles, we choose e as the minimizcr of a decision function 
dxie, /), which role is to drive the generated triangles towards an optimal aspect ratio. While the most 
natural choice for dT{e,f) corresponds to the split that minimizes the error after bisection, namely 

dT(e,/) = eTi(/)^ + (,/)?, 

we shall instead focus our attention on a decision function which is defined as the norm of the 
interpolation error 

drie, f) = 11/ - iTlfWL^Tl) + 11/ - Inf\W(ny (2.18) 

For this decision, the analysis of the algorithm is made simpler, due to the fact that we can derive explicit 
expressions of ||/ — /t/||_li(t) when / = g is a quadratic polynomial with a positive homogeneous part 
q. We prove in §3 that this choice leads to triangles with an optimal aspect ratio in the sense of a small 
Pq(T). This leads us in §4 to a proof that the algorithm satisfies the optimal convergence estimate (2.17) 

in the case where / is and strictly convex. 

Remark 2.5 It should be well understood that while the decision function is based on the norm, the 
selection of the triangle to be bisected is done by maximizing eT{f)p- The algorithm remains therefore 
governed by the LP norm in which we wish to minimize the error ||/— /iv||p for a given number of triangles. 
Intuitively, this means that the L^-norm influences the size of the triangles which have to equidistribute 
the error, but not their optimal shape. 

RemEirk 2.6 It was pointed out to us that the norm of the interpolation error to a suitable convex 
function is also used to improve the mesh in the context of moving grid techniques, see [6]. 

We define a variant of the decision function as follows 

DT{e,f) := \\f-lTf\\LHT)-dT{e,f). 

Note that Dxie, f) is the reduction of the interpolation error resulting from the bisection of the edge 
e, and that the selected edge that minimizes dri-, f) is also the one that maximizes D{-, /). The function 
Dt has a simple expression in the case where / is a convex function. 
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Lemma 2.7 Let T be a triangle and let f be a convex function on T. Let e he an edge ofT with endpoints 
zo and zi . Then 

// in addition f has smoothness, we also have 

in 

DT{e,f) = ^ / {d'^f{zt)e,e)min{t,l~t}dt, where Zt := {1 - t)zo + tzi. (2.20) 
6 Jo 

Proof: Since / is convex, we have /t/ > / on T, hence 

||/-/t/IUi(t)=^(/t/-/). 

Similarly 7ti/ > / on and Ix^f > f on T^, hence 

DT{e,f)= f Lrf- f L^if - I Ir^f. 
Jt Jt^ Jt^ 

Let Z2 be the vertex of T opposite the edge e. Since the function / is convex, it follows the previous 
expression that Drie, f) is the volume of the tetrahedron of vertices 

zo,x + zi^;c ^ zo^y + zi^y /fo+fi A \ {zi^x,Zi,y,f{zi)) for i = 0,1,2. 



where (zi^x, Zi^y) are the coordinates of Zi. Let u = zq — Z2 and v = Zi — Z2- We thus have Drie, /) = 
i| det(Mj| where 



M := 



^X O 



f{Z0)-f{Z2) f{zl)-f{Z2) f{^]-f{Z2) 



Subtracting the half of the first two columns to the third one we find that M has the same determinant 
as 

(Ux Vx 

Uy Vy 

f{Z0)-f{Z2) f{zl)-f{Z2) / (£U±£1) _ /(£o)+Z(£l) 

Rccalhng that 2\T\ = \ dct(w. v)\ wc therefore obtain (2.19). In order to estabhsh (2.20), wc observe that 
we have in the distribution sense 5^ (min{i, 1 — t}+) = Sq — 2Si/2 + Si, where dt is the one-dimensional 
Dirac function at a point t. Hence for any univariate function /i e C^([0, 1]), we have 



1 



1 



h"{t) min{t, 1 - t}dt = h{0) - 2/i(l/2) + h{l). 
Combining this result with (2.19) we obtain (2.20). 



3 Positive quadratic functions 

In this section, we study the algorithm when apphed to a quadratic polynomial q such that det(q) > 0. 
We shall assume without loss of generality that q is positive definite, since all our results extend in a 
trivial manner to the negative definite case. 

Our first observation is that the refinement procedure based on the decision function (2.18) always 
selects for bisection the longest edge in the sense of the q-metric | • |q defined by (2.7). 
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Lemma 3.1 An edge e of T maximizes DT{e,q) among all edges of T if and only if it maximizes |e|q 
among all edges of T . 

Proof: The hessian d^q is constant and for all e e one has 

(^2^6,6) = 2q(e). 

If e is an edge of a triangle T, and if g is a convex quadratic function, equation (2.20) therefore gives 

DT{e, q) = ^q(e) ^ rmn{t, 1 - t]dt = ^|e|2. (3.21) 
This concludes the proof o 

It follows from this lemma that the longest edge of T in the sense of the q-metric is selected for 
bisection by the decision function. In the remainder of this section, we use this fact to prove that the 
refinement procedure produces triangles which tend to adopt an optimal aspect ratio in the sense that 
pq{T) becomes small in an average sense. 

For this purpose, it is convenient to introduce a close variant to Pq{T): if T is a triangle with edges 
a,b,c, such that |a|q > |6|q > |c|q, we define 

^ rr^._ qW + q(c) _ K + K 

•'^ ^ 4|T|Vdit^ ~ 4|T|Vdit^" ^ ^ ^ 

Using the inequalities \b\^ + |c|q < 2|a|q and |a|q < 2(|6|q + |c|q), we obtain the equivalence 



P^jT) ^ ^ ^ Pq(r) 



< aq{T) < ^1^. (3.23) 



Similar to Pq, this quantity is invariant under a linear coordinate changes (j), in the sense that 

aqo^(T) = aq(<^(T)), 
From (2.15) and (3.23) we can relate o-q to the local approximation error. 

Proposition 3.2 There exists a constant Cq, which depends only on the choice of At, such that for any 
triangle T, quadratic function q and exponent 1 < p < oo, the local Lf -approximation error satisfies 

Co'eT{q)p < aq(r)||v^d^||i.(T) < CoeT{q)p. (3.24) 

where -:=- + !. 

T p 

Our next result shows that crq(T) is always reduced by the refinement procedure. 

Proposition 3.3 If T is a triangle with children Ti and T2 obtained by the refinement procedure for the 
quadratic function q, then 

max{aq(ri),aq(r2)}<aq(T). 

Proof: Assuming that |a|q > |6|q > |c|q, we know that the edge a is cut and that the children have area 
|T|/2 and edges a/2, b, (c - b)/2 and a/2, (6 - c)/2, c (recall that a + 6 + c = 0). We then have 



2|r|yd^(7q(Ti) < q(^)+qf^j (3.25) 



(3.27) 



2 

q(&) + q(c) 



2|r|Vdet^fTq(r). (3.28) 
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Remark 3.4 When q is the euclidean metric, the triangle that minimizes (Tq is the half square. This 
is consistent with the above result since it is the only triangle which is similar (i.e. identical up to a 
translation, a rotation and a dilation) to both of its children after one step of longest edge bisection. 

Remark 3.5 A result of similar nature was already proved in [12] : longest edge bisection has the effect 
that the minimal angle in any triangle after an arbitrary number of refinements is at most twice the 
minimal angle of the initial triangle. 

Our next objective is to show that as wc iterate the refinement process, the vahic of o-q{T) becomes 
bounded independently of q for almost all generated triangles. For this purpose we introduce the following 
notation: if T is a triangle with edges such that |a|q > |6|q > |c|q, we denote by V'q(^) the subtriangle of 
T obtained after bisection of a which contains the smallest edge c. We first establish inequalities between 
the measures cTq and pq applied to T and ipq{T). 

Proposition 3.6 Let T be a triangle, then 

aq(Vq(T)) < ^pq(r) (3.29) 
PqiUT)) < + (3-30) 



Proof: We first prove (3.29). Obviously, i^qiT) contains one edge s e {a, 6, c} from T, and one half edge 

' a b c^ 
~2' 2' 2J 



t e {f , |, f } from T. Therefore 



"-(^-(^^^ ^ 4iv;q(T)ivd^ ^ mvm = s^''^'^)- 

For the proof of (3.30), we restrict our attention to the case q = a:^ without loss of generality thanks 
to the invariance formula (2.14). Let T be a triangle with edges \a\ > \b\ > \c\. If h is the width of T in 
the direction perpendicular to a, then 

^_ 2|r| _ 2\a\ 
\a\ Pq{T)- 

The sub-triangle tpq{T) of T has edges f , c, d where d = and the angles at the ends of f are acute. 
Indeed 

(c,a/2) = 1 - lap - \c\') < and {d,a/2) = \ {\c\^ - \h\^) < 0. 
By Pythagora's theorem we thus find 



a 



2 



max{ - ,|c| ,\d\ } < 



4 V ' Am 



Dividing by the respective areas of T and 4'q{T), we obtain the announced result. o 

Our next result shows that a significant reduction of (Tq occurs at least for one of the triangles 
obtained by three successive refinements, unless it has reached a small value of cTq. We use the notation 
V2(T) := Vq(V'q(2^)) and V^(T) := Vq(V'^(T)). 

Proposition 3.7 Let T be a triangle such that crq(V'q(r)) > 5. Then crq(V'q(T)) < 0.69c7q(r). 
Proof: The monotonicity of CTq established in Proposition (3.3) implies that 

5 < CTq(V^(T)) < CTq(V^(T)) < CTq(Vq(T)). 

Combining this with inequality (3.29) we obtain 

8 < min{pq{ijl{T)), Pqi^T)), Pq{T)}. 
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According to inequality (3.30), if a triangle 5* obeys Pq{S) > 4, then | (^1 + pi^g^ ^ < 1 and therefore 
Pq(V'q('S')) < Pci{S)- We can apply this to S* = 4><i{T) and S = T, therefore obtaining 

Pq«(T)) < Pq(Vq(r)) < p^{T). (3.31) 

We now remark that inequality (3.30) is equivalent to {pq{S) — Pq(V'q('^)))^ ^ Pq(V'q('^))^ ~ 16, hence 

Pq(S) > pMS)) + y^^^^0^(^))^^ (3.32) 

provided that Pq{S) > Pq{ipq{S)). Applying this to S = V'q(^) and recalling that Pq{tjj'^{T)) > 8 we 
obtain 

Pq(V'q(T)) > 8 + - 16 > 14.9. 
Applying again (3.32) to 5 = T we obtain 

Pq(T) > 14.9 + V14.92 - 16 > 29.3. 

Using (3.30), it follows that 

16 \ 16 \ 16 



pmT)) ^ 1 A , 16 

p(r) - 8 



< 0.171. 



{iT))j V P^(V'q(T)); V PiiT) 

Eventually, the inequalities (3.23) imply that 

2<Tq«(T)) < p^ii^liT)) < 0.171pq(T) < 0.171(8aq(r)) 
which concludes the proof. o 

An immediate consequence of Propositions 3.3 and 3.7 is the following. 

Corollary 3.8 If {Ti)^^i are the eight children obtained from three successive refinement procedures from 
T for the function q, then 

• for alii, a^{Ti) < a^{T), 

• there exists i such that a-q(Tj) < 0.69c7q(T) or crq(Ti) < 5. 

We are now ready to prove that most triangles tend to adopt an optimal aspect ratio as one iterates the 

refinement procedure. 

Theorem 3.9 Let T be a triangle, and q a positive definite quadratic function. Let k = —^^^^ffgi^y-^ • 

Then after n applications of the refinement procedure starting from T, at most Cn^ 7"^'^ of the 2" generated 
triangles satisfy o-q(/S') > 5, where C is an absolute constant. Therefore the proportion of such triangles 
tends exponentially fast to as n ^ +00. 

Proof: If we prove the proposition for n multiple of 3, then it will hold for all n (with a larger constant) 
since cxq decreases at each refinement step. Wc now assume that n = 3m, and consider the octree with 
root T obtained by only considering the triangles of generation 3i for i = 0, - ■ ■ ,n. 

According to Corollary 3.8, for each node of this tree, one of its eight children either checks (Tq < 5 
or has its non-degeneracy measure diminished by a factor 9 := 0.69. We remark that if cTq is diminished 
at least k times on the path going from the root T to a leaf S, then aq{S) < 5. As a consequence, 
the number N(m) of triangles S which are such that (Jq{S) > 5 within the generation level n = 3m is 
bounded by the number of words in an eight letters alphabet {ai, • • • ,as} with length m and that use 
the letter as at most k times, namely 

fe / 

' m 



N{m) < ^ , 7™-' < Cm*=7'", 
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which is the announced result. 



<> 



The fact that most triangles tend to adopt an optimal aspect ratio as one iterates the refinement 
procedure is a first hint that the approximation error in the greedy algorithm might satisfy the estimate 
(1.1) corresponding to an optimal triangulation. The following result shows that this is indeed the 
case, when this algorithm is applied on a triangular domain to a quadratic function q with positive 
definite associated quadratic form q. The extension of this result to more general convex functions 
on polygonal domains requires a more involved analysis based on local perturbation arguments and is 
the object of the next section. 

Corollary 3.10 Let fl be a triangle, and let q be a quadratic function with positive definite associated 
quadratic form q. Let qN be the approximant of q on Q obtained by the greedy algorithm for the L^ 
metric, using the L^^ decision function (2.18). Then 

limsup7V||g- givllLf(n) < C|| Vdet(q)||i,r(n), 

AT— >-oo 

where ^ — ^ + 1 and where the constant C depends only on on the choice of the approximation operator 
At used in the definition of the approximant. 

Proof: For any triangle T, quadratic function g € 112, and exponent p, let 

4(9)?:= inf \\q-n\\L^(T) 

TrGlli 

be the error of best approximation of q on T. Let Tq be a fixed triangle of area 1, then for any g € 112 
and 1 <p <(x> one has 

eTo(«)i < eTo(9)p < eTo(9)p < eTo(g)oo. 

Furthermore, e'j,^ {■)\ and {■)oo are semi norms on the finite dimensional space 112 which vanish precisely 
on the same subspace of 112, namely Hi. Hence these semi-norms are equivalent. It follows that 

Co gto {q)p < c'to {q)p < cto {q)p (3.33) 

where co is independent g € 112 and of p > 1. Using the invariance property (2.6) we find that (3.33) 
holds for any triangle T in place of To with the same constant cq. We also define for any triangulation T, 

er(/)^= E eV(/)^= E e^(/)^, 

and we remark that coe7-(9)p < e'j-{q)p < e-r{q)p. For each n, we denote by the triangulation of SI 
produced by n successive refinements based on the L^ decision function (2.18) for the quadratic fimction 
q of interest (note that #(?;") = 2"). We also define := {T € ; crq(r) > 5}. Therefore crq(T) < 5 
if T ^ T^, and on the other hand we know from Proposition 3.3 that crq(T) < 0-q(O) for any T G T^. It 
follows from Proposition 3.2 that 

er„"(<7)p < Co(ET6r„"K(r)|r|^Vd^)'')' 

< Co(5f X 2" + aq(17)P#(r„'')) ^ (M) ^ ^dit^, 

where Co is the constant in (3.24). According to Theorem 3.9, we know that 

lim 2-"#(7;")=0. 

n— ^+oo 

Hence 

limsup2"er„"(g)p < 5Co y'detq = 5Co|| Vdetq||Lx(o). 
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We now denote by 7^ the triangulation generated by the greedy procedure with stopping criterion based 
on the error rjn ■= CQ^2^^\\^/det q||LT(a)- It fohows from (3.24) that for all T G with k < n, one has 



where we used that \T\ = 2~'^|n| and that the minimal value of o-q is 1. This shows that 7^" is a refinement 
of Tn- Furthermore any triangle T £ has at most 2'^^^' children in 7^, where k{T) is the smallest 
integer such that 

rin > Co 2 ^CTq(T)||Vdet^|U.(n)- 
Since i < r < 1 we obtain 2*=^^) < 2^ < 2^C^£7g(T) < AC^aq{T). Hence 

#{X?) < < J2 < 4Co' (5 X 2" + a^{n)#{T:)) = Ci2"(l + c„), 

where Ci — 20 Cq and £„ -> as n — > oo. If Tat is the triangulation generated after N steps of the greedy 
algorithm, then there exists n > such that Tn is a refinement of 7^ (hence a refinement of T^) and 
7;^_^i is a refinement of Tat. It follows that #{Tn) < #(7;f+i) < Ci2"+i(l + e„+i), and 

Coer«('7)p < erj^{q)p < e-'T^{(l)p < er„"(9)p, 

where we have used the fact that eij-{f)p < e'^{f)p whenever T is a refinement of T. Eventually, 

Q IOC c 
limsupA/'erN(g)p < limsup — 2"+^(l + £„+i)er„"(g) < — ||-v/detg||i,x(n), 

N-^oo n— >cx> Co Co 

which concludes the proof. o 



4 The case of strictly convex functions 

The goal of this section is to prove that the approximation error in the greedy algorithm applied to a 
function / satisfies the estimate (1.1) corresponding to an optimal triangulation. Our main result is so 
far limited to the case where / is strictly convex. 

Theorem 4.1 Let f e C^{fl) be such that 

<^ f{x) > ml, for all X Gil 

for some arbitrary but fixed m > independent of x. Let /jv be the approximant obtained by the greedy 
algorithm for the L^ metric, using the L^ decision function (2.18). Then 

limsupiV||/-/jv||LP < C\WABi{d'f)\\Lr, (4.34) 

where ^ = ^ + 1 and where C is an absolute constant (i.e. independent of p, f and m). 

Equation (4.34) can be rephrased as follows : there exists a sequence s^if) such that Ewif) as 
N ^ oo and 

Wf-fNh. < [c\\^dct{d\f)\\Lr+eNif))N-\ 

Note also that since || ^ dct{d,'^ f)\\ > 0, there exists Na{f ) such that ||/-/Ar||Lp < 2C|| i/dct(rf2/)||^^7V^i 
for all N > No{f ). It should be stressed hard that A^o(/) can be arbitrarily large depending on the func- 
tion /. Intuitively, this means that when / has very large hessian at certain point, it takes more iterations 
for the algorithm to generate triangles with a good aspect ratio. The extension of this result to strictly 
concave functions is immediate by a change of sign. Its extension to arbitrary functions is so far 
incomplete, as it is explained in the end of the introduction. The proof of Theorem 4.1 uses the fact that 
a strictly convex function is locally close to a quadratic function with positive definite hessian, which 
allows us to exploit the results obtained in §3 for these particular functions. 
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4.1 A perturbation result 

We consider a triangle T, a function / e C^{T), a convex quadratic function q and fi > such that on T 

d^q < d^f < (1 + m) d^q. (4.35) 

It follows that det(rf2g) < det(rf2/) < det((l + iJ-)d'^q) = (1 + A*)^ detCd^g). Since det{d'^q) = 4det(q), we 
obtain 

2\\^d^\\L^f^T) < ||v/dct(d2/)||i.(T) < 2(l + /^)||v/d^||i.(T). (4.36) 
The following Lemma shows how the local errors associated to / and q are close 

Proposition 4.2 The exists a constant Cg > 0, depending only on the operator At such that 

(1 - CeH)eT{q)p < eT{f)p < (1 + Cefi)eT{q)p. (4.37) 

Proof: It follows from inequality (4.35) that the functions f — q and (1 + iJ,)q — f are convex, hence 

lTif-q)-if-q)>0 and ItHI + fi)q - f) - {{1 + ^i)q - f) > 
on the triangle T. We therefore obtain 

< (/t/ - /) - {Irq -q)< /^(/t« - q). 
There exists a constant Co > depending only on At such that for any h G C^{T), 

eT{h)p<Co\T\^\\hU^^r)- 
Furthermore according to Proposition 2.3 there exists a constant Ci > depending only on At such that 

\T\'/^q- iTqUoo^T) < CieT{q)p. 

Hence 

|eT(/)p - eT{q)p\ < erif - q)p 

= eriilTf - /) - (Irq - q))p 

< Co\T\-v\\{lTf-f)-{lTq-q)\\Loo(T) 

< Co\T\l^\WTq-q)\\Loo^T) 

< CoCiiJ.eT{q)p 

This concludes the proof of this Lemma, with Cg = CqCi. o 
Note that using Proposition 3.2, and assuming that /U < Cg := we have with ^ :=! + -, 



eT{f)p ~ eT{q)p ~ aq(T)|| V| detq||U.(T) ~ a^{T)\W det{d^ f)\\ ^t) , (4.38) 

with absolute constants in the equivalence. 

We next study the behavior of the decision function e dT{e, /). For this purpose, we introduce the 
following definition. 

Definition 4.3 Let T be a triangle with edges a, b, c. A 6-near longest edge bisection with respect to the 
q-metric is a bisection of any edge e G {a, b, c} such that 

q(e) > (1 - S) max{q(a), q(&), q(c)} 

Proposition 4.4 Assume that f and q satisfy (4-35). Then, the bisection ofT prescribed by the decision 
function e dT{e,f) is a n-near longest edge bisection for the q-metric. 
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Proof: It follows directly from Equation (2.20) that for any edge e of T, 

Drie, q) < Drie, f) < Drie, (1 + /x)?), 

hence we obtain using (3.21) 

^q(e) < Dr{e, /)<(! + /.)§q(e). (4.39) 

Therefore the bisection of T prescribed by the decision function e dxie, /) selects an e such that 

(1 + M)q(e) > niax{q(a), q(6), q(c)}. 

It is therefore a 5-near longest edge bisection for the q-metric with S = < ji and therefore also a 
/i-near longest edge bisection. o 

In the rest of this section, we analyze the difference between a longest edge bisection in the q-metric 
and a (5- near longest edge bisection. For that purpose we introduce a distance between triangles : if Ti, T2 
are two triangles with edges ai, 61, ci and 02, &2, C2 such that 

q(ai) > q(6i) > q(ci) and q(a2) > q(62) > q(c2), (4.40) 

we define 

Aq(Ti,T2) = max{|q(ai) - q(a2)|, |q(6i) - q(62)|, |q(ci) - q(c2)|}. 
Note that Aq is a distance up to rigid transformations. 

Lemma 4.5 Let Ti,T2 be two triangles, let {Ri,Ui) and {R2,U2) be the two pairs of children from the 
longest edge bisection of Ti in the q-metric, and a 5-near longest edge bisection of T2 in the (\-metric. 
Then, up to a permutation of the pair of triangles {Ri, Ui), 

max{Aq(i?i,i?2),Aq([/i,?72)} < ^Aq(Ti,T2) + 5q(a2). 

where 02 is the longest edge of T2 in the q-metric. 

Proof: We assume that the edges of Ti and T2 are named and ordered as in (4.40). Up to a permutation, 
Ri and Ui have edge vectors 6i,ai/2, (ci — 6i)/2 and ci,ai/2, (61 — ci)/2. Two situations might occur 
for the pair {R2, U2)- 

• q(e) < (1 — (5)q(a2) for e = 62 and C2- In such a case the triangle T2 is bisected towards 02, so that 
up to a permutation, R2 and U2 have edge vectors 62,02/2,(02 — 62)/2 and 02,02/2,(62 — C2)/2. 
Using that q((c - b)/2) = q(c)/2 + q(6)/2 - q(a)/4 when a + 6 + c = 0, it clearly follows that 

max{Aq(i?i,i?2),Aq([/i,[/2)} < jAq(Ti,T2). 

• q(e) > (1 — '5)q(a2) for some e = 62 or C2. In such a case T2 may be bisected say towards 62, so that 
up to a permutation, R2 and U2 have edge vectors 02,62/2,(02 — a2)/2 and 02,62/2,(62 — 02)/2. 
But since |q(62) — q(a2)| < Sq{a2), we obtain that 

max{Aq(i?i,ii2), Aq(f/i,[/2)} < ^Aq(ri,T2) + 5q(a2). (4.41) 

o 

We now introduce a perturbed version of the estimates describing the decay of the non-degeneracy 
measure which were obtained in Proposition 3.3 and Corollary 3.8. 
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Proposition 4.6 // are the two children obtained from a refinement of a triangle T in which a 

5-near longest edge bisection in the q-metric is selected, then 

max{aq(ri), a^{T2)} < (1 + i6)a^{T). (4.42) 

If {Ti)f^i are the eight children of a triangle T obtained from three successive refinements in which a 
8-near longest edge bisection in the q-metric is selected, then 

• for all i, a^{T,) < (Tq(r)(l + Ca^), 

• there exists i such that fTq(Tj) < 0.69fTq(T)(l + C2S) or crq(Tj) < M, 
where C2 = ^ and M = 5(1 + C26). 

Proof: We first prove (4.42), and for that purpose we introduce the two children T[,T2 obtained by 
bisecting the longest edge of T in the q-metric. If follows from (4.41) that, up to a permutation of the 
pair (T{,r^), 

max{Aq(Ti,T0,Aq(T2,T^)} < Sq{a), 
where a is the longest edge of T in the q-metric. Hence 

|aq(r,) - aq(T/)| < J^l'^^ljl^ < 25— ^i^L= < 4Sa^{T). (4.43) 
4|Ti|Vdet(q) 4|Ti|Vdet(q) 

We know from Proposition 3.3 that max{c7q(T{),crq(T2)} < (Tq(T). Combining this point with (4.43) we 
conclude the proof of (4.42). 

We now turn to proof of the second part of the proposition and for that purpose we introduce the 

eight children (T/)f^j^ obtained from three successive refinements of T in which the longest edge in the 
q-metric is selected. Iterating (4.41), we find that, up to a permutation of the triangles {T-)^^-^^, one has 

5 _ , 6U , , C2S 



. maxg Aq(T,,T/) < |^1 + - + j J Sq{a) = -Sq{a) = -^q{a), 

where, again, a is the longest edge of T in the q-metric. Repeating the argument (4.43) we find that 

. niax |aq(Ti) - aq(T/)| < C2Sa^{T). (4.44) 

2— I,--- ,8 

We know from Corollary 3.8 that o-q(T/) < (Tq{T) for all i and that there exists i such that either 
cTqiT-) < 0.69c7q(T) Or CTq(r/) < 5. Combining this point with (4.44) we conclude the proof of the 
proposition. o 



4.2 Local optimality 

Our next step towards the proof of Theorem 4.1 is to show that the triangulation produced by the 
greedy algorithm is locally optimal in the following sense: if the refinement procedure for the function / 
produces a triangle T G 2? on which / is close enough to a quadratic function q, then the triangles which 
are generated from the refinement of T tend to adopt an optimal aspect ratio in the q-metric, and a local 
version of the optimal estimate (1.1) holds on T. 

We first prove that most triangles adopt an optimal aspect ratio as we iterate the refinement procedure. 
Our goal is thus to obtain a result similar to Theorem 3.9 which was restricted to quadratic functions. 
However, due to the perturbations by C2/J, that appear in Proposition 4.6, the formulation will be slightly 
different, yet sufficient for our purposes: we shall prove that the measure of non-degeneracy becomes 
bounded by an absolute constant in an average sense, as we iterate the refinement procedure. 
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As in the previous section, we assume that / and q satisfy (4.35). For any T, we define T^iT) the 
triangulation of T which is buih by iteratively applying the refinement procedure for the function / to 
all generated triangles up to 3n generation levels. Note that 

#(7^(T)) = 23" and |T'| = 2-3"|T|, T'gT^{T). 

For r > 0, we define the average r-th power of the measure of non-degeneracy of the 2^" triangles obtained 
from T after 3n iterations by 



23n 

T'er„"(T) 



We also define 



7(r, ^l) := i (0.69(1 + Ca/i)) ' + ^(1 + C2M)^ 



where C2 is the constant in Proposition 4.6. Note that for any r > 0, the function 7(r, •) is continuous 
and increasing, and that < 7(r, 0) < 1. Hence for any r > 0, there exists /i(r) > and < 7(r) < 1 
such that 7(r, fi) < 7(r), if < /U < /u(r). 

Proposition 4.7 Assume that f and q satisfy (4-35) with < /U < /i(r). We then have 



a-Jn) < aUT)^{rr + 



8(1-7(0)^ 

where M is the constant in Proposition 4-6. Therefore 



<7^(n)<C3:=l + — - 



8(1- 7(0) ' 



2/23" > 8CTq(T)^ with A := 



7(r) ■ 



Proof: Let us use the notations u = 0.69(1 + C2/U) and v = {1 + C2/U). According to Proposition 4.6, we 
have 

< E«), 

where E is the expectation operator and (t„ is the Markov chain with value in [1, +00 [ defined by 

• an+i = max{CT„u, M} with probability a := |, 

• (T„+i = (TnV with probability /3 := |, 

• ctq := (Tq(To) with probability 1. 



Denoting by /Lt„ the probability distribution of cr„, we have 

/oo 
cT'-rf/x„+i(a) 



(a(max{ufT, M}y + l3{vay) dnn{(^) 

f-Mju />oo /"-t-oo 

aiVf / dfj,n{(T) + au^ / diin{(^) + Pv^ / a^diJ,n{<y) 

Jl JM/u Jl 



iM/u 

< aAr + (au"- + ^vniE«) 

< aAr+7(r)E(a;) 

By iteration, it follows that 

E«)<E(aS)7(r)" + 3^^, 

which gives the result. 
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Our next goal is to show that the greedy algorithm initialized from T generates a triangulation which 
is a refinement of T^(T) and therefore more accurate, yet with a similar amount of triangles. To this 
end, we apply the greedy algorithm with root T and stopping criterion given by the local error 

r] := min er'iDv 

Therefore T' is splittcd if and only if eT'{f)p > f]- Wc; dcinotc by Tn{T) the resulting triangulation whore 
A'^ is its cardinality. From the definition of the stopping criterion, it is clear that Tn{T) is a refinement 
of7;«(T). 

Proposition 4.8 Assume that f and q satisfy (4-35) with jJ- < \, and define tq := i„ 3 > 0. We 
then have 

N < C423«^^^, 

where C4 is an absolute constant. Assuming in addition that fjt < /i(ro) as in Proposition ^.l, we obtain 
that 

N < (752^", 

«/23" > 8aq(T)^ with A := -r^^, and where C5 = C3C4. 

Proof: Let Ti be a triangle in T^{T) and T2 a triangle in Tn(T) such that T2 C Ti. We shall give a 
bound on the number of splits k which were applied between Ti and T2, i.e. such that IT2I = 2~'^|ri|. 
We first remark that according to Proposition 3.2 and (4.38), we have 

T] > c^ nan^^ |r'|^+p aq(r') V^detq > c|Ti|^+p V^detq, 

where c is an absolute constant. On the other hand, using both Proposition 4.2 and Proposition 4.6, we 
obtain 

eT.(/)q <C|T2|i+^aq(T2)Vdit^ 

= c|ri|^+h-'=(i+?Vq(T2)^/dit^ 

C|Ti 1^+^ aq(Ti) (2-(i+f) (1 + 4/x)) '\/diW. 

< f aq(Tl)(|)S, 

where C is an absolute constant. Therefore wc sec that k is at most the smallest integer such that 
^CTq(Ti)(|)'^ < 1. It follows that the total number n(Ti) of triangles T2 e Tn{T) which are contained in 
Ti is bounded by 

n(ri) < 2^= < 2 f-(7q(Ti) 



< 

< 



^ c 

and therefore 

Tier^(T) ^ ^ Tier^(T) 

with C4 = 2 {^Y\ The fact that N < C523" when 2^" > 8crq(T)^ with A := is an immediate 

consequence of Proposition 4.7. o 



4.3 Optimal convergence estimates 

Our last step towards the proof of Theorem 4.1 consists in deriving local error estimates for the greedy 
algorithm. For 77 > 0, we denote by the approximant to / obtained by the greedy algorithm with 
stopping criterion given by the local error 77 : a triangle T is splitted if and only if eT{f)p > rj. The 
resulting triangulation is denoted by 

Tr, = Tn, with TV = TV (77) = #{%,). 
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For this N, we thus have f^j = fj^. For a given T generated by the refinement procedure and such that 
V < &T{f)p, we also define 

r^(T) = {T'cT; T'erj 
the triangles in %, which are contained in T and 

N{T,r,) = #{%{T)). 

Our next result provides with estimates of the local error ||/ — fn\\Lp{T) and of N{T,r]) in terms of r], 
provided that n is small enough. 

Theorem 4.9 Assume that f and q satisfy (4-35) with n < C2 := min{|,/i(ro)}, and that r] < r]o, where 



% = %(T) := ( ) Vd^. 



with X := -^P^^, anrf i = i + 1. Then 

— ln7(ro)' T p 



\\f-fjLPiT)<vNiT,n)^, (4.45) 
iV(r,j?)<C6r?-^||Vdet(dV)IIL.(T), (4-46) 

where Cq is an absolute constant. 
Proof: The first estimate is trivial since 

II/-/.I|l.(t) = ( E eT'(/)^)'<( E V^'Y =vNiT,rj)'.. 

T'eT„{T) T'e%,{T) 

In the case p = oo, we trivially have 

II/-/^I|loo(t) <v- 

For the second estimate, we define no = no(T) the smallest positive integer such that 2^"°(^) > 8f7q(T)'^ 
with A := • For any fixed n > no, we define 

r]n := min eT'{f)p. 

We know from Proposition 4.8 that with the choice rj = r]n 

N{T, r!,,) <Cr,2^". (4.47) 



On the other hand, we know from Proposition 4.7, that crq{n) < C3, from which it follows that 

min (jJT')<cf°. 

T'eT,TiT) ' ~ 



According to Proposition 4.2, we also have 

|T| N - 

TV ' f c7„('J")\/deta < G'a"C'l 

T'er^(T) 

where C is an absolute constant, which also reads 



<C;°C'''n-''\T\^/d^ . 
Combining this with (4.47), we have obtained the estimate 



20 



which by Proposition 4.2 is equivalent to (4.46) with rj = r/n. In order to obtain (4.46) for all arbitrary 
values of rj, we write that rjn+i < t] < rjn for sonic n > no, then 

N{T,7j) <N{T,r]n+i) 
< C523("+i) 

which by Proposition 4.2 is equivalent to (4.46). In the case where r] > rjno, we simply write 

N{T,rj) <N{T,r^^,) 

< C523"o 

< 64C5aq(T)^ 



= 6AC5r]o^\T\Vd^ 
< 64(757/-'" I TlVdetq' 



and we conclude in the same way. 



We remark that combining the estimates (4.45) and (4.46) in the above theorem yields the optimal 
local convergence estimate 



||/-/,IU.(T) < Ci\Wdet{d^f)U^^T)N{T,rj)-\ 

In order to obtain the global estimate of Theorem 4.1, we need to be ensured that after sufficiently many 
steps of the greedy algorithm, the target / can be well approximated by quadratic function q = q{T) on 
each triangle T, so that our local results will apply on such triangles. This is ensured due to the following 
key result. 

Proposition 4.10 Let f be a function such that d!^f{x) > ml for some arbitrary but fixed m > 
independent of x. Let Tn be the triangulation generated by the greedy algorithm applied to f using the 
L^ decision function given by (2.18). Then 

lim max diam(T) = 0, 
i.e. the diameter of all triangles tends to 0. 

Proof: Let T be a triangle with an angle at a vertex zq. The other vertices of T can be written as 
zi = zq + au and Z2 = Zq + fiv where a, /3 G K+ and u,v ^M? are unitary. We assume that au is the 
longest edge of T, hence 9 < 7r/2. Observe that 

Q'2 2q 



and 

\u — v\ = 2 sin 



T| ia^sin6' /3 sin 61' 
sin 6 2a 



cos(l) /3p(T)cos(|) 



Since < cos (f ) we thus obtain 



2y/2a 3a 
\u — v\ < ^ < 



pp{T) - pp{T) 
We now set M := ||(iVlk~(n) and for all ^ > let 

uj{5) := sup \\d^f{z) - d^f{z') 

z,z'£i1,\\z-z'\\<S 
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For t e R, we define 

:— (ffzo+tu and := (ffzo+tv 
and notice that \\H" — H^W < uj{t\u — v\). Hence, if < f < ^, we have 



\H]^-H-\\<w 



Furthermore, for all t we have 



I {H- u, u) -{H^v,v)\ =\ {H- u, u)-{H- u-{u-v),u-{u-v))\ 

= \2{H^ u,u-v)- {H^ {u -v),u-v)\ 

< 2M|m||m -v\+ M\u - ■yp 



^ 0^ [^p{T) + {p{T)J J ■ 

Applying the identity (2.20) to the edges e = au and jSv, and using a change of variable, wc can write 

DT{au,f)= / mm{t,a-t}+{H^u,u)dt and DT{Pv,f)= / mm{t, l3 - t}+{H^ v,v)dt 
where we have used the notation r+ := max{r, 0}. Hence, noticing that 

/ mm{t,X-t}+dt = [ mm{t,X-t}dt= 
Jm. Jo 4 

and using the previous estimates we obtain 

DriauJ) - Dri^vJ) = [ {mm{t,a - t}+{H^ u,u) - mm{t, p - t}+{H^ v,v)) dt 

= / {mm{t,a- t}+ - mm{t,P - t}+){H^ u,u)dt 

Jk 

- [ mm{t, 13 - t}+{{H^ v, v) - {H^ u, u))dt 

Jm. 

> m {miii{t,a — t}+ — mm{t, P — t}+)dt 
Jm 

- / min{i,/3 - t}{\{H^ u,u) - {H^ v,v)\ + \{{H^ - H^)v,v)\)dt 
Jo 

, p(T) ^ \p{T)J ] 4 \p{T)J 



^ A T 

4 4 



where we have used the fact that a > /3 in the last line. We can therefore write 

DTiau,f)-DT{Pv,f) > ^(a2(l-(i(-l^)) -/32), (4.48) 

where we have set 

Cj(S) := — (q6 + 96^ + w(3 diam(n)(5)') . 

The inequality (4.48) shows that dri-, f) prescribes a w^^^^^ -near longest edge bisection in the euclidean 
metric for any triangle T. Indeed if the smaller edge I3v was selected, we would necessarily have 



l,.P=,.>(i_,(_L_)),.^.(i-.(_L))| 



au\^. 
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Notice that oj{5) ^ as 5 ^ 0. 

Since / is strictly convex, there does not exists any triangle T d such that eT{f)p = 0. Let us 
assume for contradiction that the diameter of the triangles generated by the greedy algorithm does not 
tend to zero. Then there exists a sequence (Tj)j>o of triangles such that T,+i is one of the children of 
Tj, and /it; — > d > as i — > oo, where hr denotes the diameter of a triangle T. Since |Tj| — >• 0, this also 
implies that p{Ti) — )• +oo as i — )• oo. We can therefore choose i large enough such that h"^. < and 

^^'•^( p(Tj) ) — \ ^^'^ ^ J — ^' where C2 is the constant in Proposition 4.6. According to this Proposition, 
we have 

a(T,+3) < ^a(T,), 

where a stands for CTq in the euclidean case q = + y^. On the other hand, we have for any triangle T, 
g|y|- < o{T) < 2|y|-, from which it follows that 

Therefore, < which is a contradiction. This concludes the proof of Proposition 4.10. o 

Proof of Theorem 4.1 Since / € C^, an immediate consequence of Proposition 4.10 is that for all 
IJ.> 0, there exists 

ATi :=iVi(/,M), 

such that for all T e Tn^ , there exists a quadratic function qt such that 

d^qr < d^f < (1 + m) d^Qr- 
Therefore our local results apply on all T e Tni ■ Specifically, we choose 

with C2 the constant in Theorem 4.9. We then take 



We use the notations 

fn = fN, %i = Tn, N = N{r^) = = #{Tn), 

for the approximants and triangulation obtained by the greedy algorithm with stopping criterion given 
by the local error rj. Note that 7^ is a refinement of Tni, since -q < minTgrwi ^T{f)p, and therefore 
N > Ni. We obviously have 

\\f - JnWl. < VN-^ . 

Using Theorem 4.9, we also have 

N= J2 N{T,n)<CeV-lWdet{d^f)\\l.^n), 

and therefore ^ 

\\f-fN\\ < Ci\Wdet{d^f)h^^n)N-\ 

which is the claimed estimate. Since we have assumed < r?o, this estimate holds for 

N>No, 

where A^o is largest value of N such that eT{f)p > rjo for at least one T e Tn- o 



Remark 4.11 In [8] a modification of the algorithm is proposed so that its convergence in the U' norm 
is ensured for any function f e (or f & C{^) when p = 00). However this modification is not 

needed in the proof of Theorem 4-1, due to the assumption that f is convex. 
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